rank | frequency | n-gram |
---|---|---|
1 | 15322 | -ा |
2 | 10625 | -ो |
3 | 5775 | -े |
4 | 5086 | -ी |
5 | 3918 | -र |
rank | frequency | n-gram |
---|---|---|
1 | 9149 | -को |
2 | 5908 | -मा |
3 | 4090 | -का |
4 | 3554 | -ले |
5 | 2817 | -ाई |
rank | frequency | n-gram |
---|---|---|
1 | 2575 | -लाई |
2 | 1457 | -बाट |
3 | 1253 | -हरू |
4 | 1203 | -ाको |
5 | 810 | -ामा |
rank | frequency | n-gram |
---|---|---|
1 | 703 | -रूको |
2 | 395 | -देखि |
3 | 393 | -रूले |
4 | 392 | -ालाई |
5 | 379 | -रूमा |
rank | frequency | n-gram |
---|---|---|
1 | 697 | -हरूको |
2 | 390 | -हरूले |
3 | 374 | -हरूमा |
4 | 353 | -्वारा |
5 | 332 | -रूलाई |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings